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Abstract 

The goal of this paper is to survey the history, development and cur- 
rent status of the Return Times Theorem and its many extensions and 
variations. Let (X, IF, fj.) be a finite measure space and let T : X — > X 
be a measure preserving transformation. Perhaps the oldest result in er- 
godic theory is that of Poincare's Recurrence Principle [Poi87] which 
states: 

Theorem 0.1. For any set Ac J, the set of points x of A such that 
T n x A for all n > has zero measure. This says that almost every 
point of A returns to A. In fact, almost every point of A returns to A 
infinitely often. 

The return time for a given element x £ A, 

r A (x) = inf{fc > 1 : T k x G A}, 

is the first time that the element x returns to the set A. This is visualized 
in the figure below. 




• T' 2 x 



By Theorem |0.1| there is set of full measure in A such that all 
elements of this set have a finite return time. Our study of the return 
times theorem asks how we can further generalize this notion. 
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1 Origins 

As stated above in Theorem |0.1[ /i-a.e. x returns to A infinitely often. One 
question to ask is how frequently this occurs. Consider the time average 

1 N 

n=l 

This is a measure of how frequently the element x returns to A. We can then 
define a sequence recursively which characterizes the times that x returns to A. 

Definition 1.1. The return times sequence for x with respect to the 

set A is defined as 

ki(x,A) = inf{n : T n x € A} 

and 

ki(x,A) = mi{n > fc i _ 1 (x,A) : T n x e A}. 
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1 N 

Thus the time average — Xa{T u x) is simply a measure of the density of 



=1 



the sequence ki(x,A) in the set {1, ...,N}. 

Recall the following definition of an ergodic measure-preserving system. 

Definition 1.2. The finite measure-preserving system (X, J 7 , T, u.) is ergodic 
if T- 1 (B) = B for B e T implies (i(B) = or fx(B) = (i(X). In this case T is 
called an ergodic transformation. 

Using Birkhoff's Pointwise Ergodic Theorem Bir31j . we know some- 
thing more about the density of this sequence. 

Theorem 1.3. If T is a measure preserving map on the finite measure space 
(X, T, fi) then for each real-valued f € L l (u), 



1 N 

l Tx^ f {Tnx) = E (f\ Inv )( x )>» ~ a - e - 



where Inv = {B G J- : T^ 1 (B) = B}. In particular, the limiting function has 
the same integral as f and E(/|7ra;)(a;) = J fd/i for all x € X when T is 
ergodic. 

For a given factor y of J 7 , the notation E(/|^) or Py refers to the condi- 
tional expectation of / with respect to the factor y. It should be noted that 
Birkhoff's original proof is when / is a characteristic function (as in the case we 
are considering above) in the setting of a closed analytic manifold having a finite 
invariant measure. Khintchine Khi33 showed that Theorem 



1.3 



remains true 

for an intcgrable / on an abstract finite measure space. The essence of the proof 
of Khintchine is that found in Birkhoff's proof. 

Applying Theorem |1.3| to our time average above, when T is ergodic then 
for /i-a.e. x € X 

1 N f 

n=l ^ X 

Thus, whenever A has positive measure the sequence ki(x,A) has positive den- 
sity for almost every x € X. 



1.1 Averages along Subsequences 

The study of time averages can be extended by looking at the convergence of 
the averages along some sequence kf. 

1 - 

hm-]T 5 (SS). (1-1) 

t=l 

In this case we want to know what properties of the ki would guarantee a.e. 
convergence. In studying such sequences we can consider cither deterministic 
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sequences such as the sequence of squares or primes as well as randomly gener- 
ated sequences such as those that come from looking at the values n for which 
Xa(T u x) — 1. We are interested in finding when a randomly generated sequence 



will give convergence of the averages in equation (1.1 ). 

In 1969, A. Brunei and M. Keane BK69 use sequences of this type to deter- 
mine a more powerful result concerning time averages along randomly generated 
subsequences. The result of interest is Theorem 1 on page 3 of their paper. They 
follow up this result by looking at norm convergence of similar averages as well. 
It has been indicated by Michael Keane that the main idea for this return times 
average was a part of Antoine Brunei's thesis !Br u66| . Their result states 

Theorem 1.4. If (Y,Q, v, S) is a measure preserving system and if g € L 1 ^) 
and if ki,k2, ■ ■ ■ is a uniform sequence, then 



n 

)(y)=iun-J2g(S k *y) 

r>. 71 * * 



9L. 

n n 

1=1 

exists almost everywhere and g 6 L (y). 

A uniform sequence is a sequence ki , . . . of natural numbers such that 
there exists 

1. a strictly L-stable system (X, J 7 , n,T) 

2. a set A G J- such that n(A) > = /u((L4) where 8A is the boundary of A 

3. a point x G X such that fcj = ki(x,A) for each i > 1 (where ki{x, A) is as 
defined above). 

Definition 1.5. A measure preserving system (X, J 7 , /i,T) is a strictly Li- 
stable system if X is a compact metric space and T is a homeomorphism of 
X such that 

1. T n is an equicontinuous set of mappings, i.e. for any e > there exists a 
6 > such that for x,y € X, d(x,y) < 5 implies d(T n x,T n y) < e for any 
integer n and 

2. X possesses a dense orbit under T, i.e., there exists some x in X such that 
{T n x : n e 1} is dense in X. 

Example 1.6. Consider X = D the unit disk {z 6 C : \z\ = 1} under the 
standard Borel measure. This is a compact measure space and let T : D — > D 
be the irrational rotation defined by Tz = e 27ria z for some irrational a. Then T 
is a homeomorphism of D. For any e > and z%, A2 € D, if \z\ — < e then 

\T n Zl - T n z 2 \ = \e 2mna Zl - e 2ma z 2 \ = \z x - z 2 \ < e. 

Therefore the T n form an equicontinuous set of mappings. As a is irrational, D 
possesses a dense orbit under T for any z £ D. Therefore this gives us a strictly 
L-stable system on which we can build a uniform sequence. In fact, according 
to Brunei and Keane [BK69] every strictly L-stable system is homeomorphic to 
some ergodic rotation on a compact abelian group. 
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Therefore if our measure preserving system is a strictly L-stable system, then 
the sequence of return times A) which was discussed in Definition |1.1| 
is a uniform sequence. Rewriting Theorem |1.4| in terms of the characteristic 
function of our set A gives: 

Theorem 1.7. For any ergodic dynamical system (X, T, fJ,,T) with /i(X) < oo 
and A C X with /i(A) > 0. Then there exists a set Xf in X of full measure 
such that for any other measure preserving system (Y, Q, v, S) and any g € T x (y) 
then 

n 

lim-V X A(T"x) 5 (S" 2/ ) 
n n *■ — ' 
i— 1 

exists for v-a.e. y. 

The interesting fact to note here and in the more general statement by 
Brunei and Keane is that the convergence is in a sense "universal" because 
it does not depend on the choice of the dynamical system (Y, Q, v, S) nor the 
function g £ L (v). 

Krengel's book Ergodic Theorems |Kre85j highlights the generalization of 
the work of Brunei and Keane to the concept of universally good sequences. 

Definition 1.8. A strictly increasing sequence (i.e. a subsequence of N) 

is universally good with respect to a stated type of convergence (this could be 
norm convergence, pointwise convergence, etc.) if for any measure preserving 
system (Y, Q, u, S) and g € L r (v), then 

t=i 

converges. 

For example, the result of Brunei and Keane can be restated to say that 
uniform sequences are universally good with respect to a.e. convergence for 
functions in L 1 ^). The Blum and Hanson Theorem jBH60| shows that when S 
is a mixing endomorphism and (Y, Q, v) is a probability space, then every strictly 
increasing sequence fcj G N is universally good with respect to the L 2 (^)-norm 
for functions g in L 2 (u). 

On the opposite side, the question of when sequences fail to be universal has 
also been an object of intense study. Friedman and Ornstein |FQ72) use a tower 
construction to create a strongly mixing operator T for which the conditions in 
Blum and Hanson are not sufficient for guaranteeing universal a.e. convergence 
for functions in L 1 (/i). Krengcl [Krc71 was able to construct "universally bad 
sequences" for which regardless of the measure preserving system chosen one can 
find a function g in L 1 ^) for which the averages do not converge a.e.. Bellow 
|Bel89j created a sequence which was universally good for a.e. convergence for 
functions in LP when 1 < p < oo, but at the same time was universally bad for 
a.e. convergence in L q when 1 < q < p. This result hints at the concerns about 
breaking the duality which we will discuss later in Section [5] 
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The above "bad" examples have all been sequences of zero density. Along 
those lines, Friedman |Fri70j posed the question of whether or not it was true 
that sequences of positive density were universally good for a.e. convergence 
with respect to functions in Conze |Con73j was able to prove that when 

S is an automorphism with Lebesgue spectrum then sequences of positive lower 
density were universally good for a.e. convergence with respect to functions 
in but he was also able to construct a counterexample to Friedman's 

question for T without Lebesgue spectrum. 

1.2 Weighted Averages 

Returning to the concept of universally good sequences, we want to consider 
these sequences as "weights" on our average as follows. When n = ki for some 
ki in our sequence we add up g(S n y) (weighting it by 1) otherwise we weight 
it by and do not add the g(S"y) term. This concept was extended to the 
convergence of more general weights in Bellow and Losert E[L85 . They begin 
by defining a good weight. 

Definition 1.9. A sequence of complex numbers a = (a n ) is a good weight 

in L p {v) for an operator S on L p (v) if, for every g £ L p (v) 



exists z^-a.e. 

Note that if our operator on L v (v) is simply the one induced by our measure 
preserving system g — > g o S, then this average is the same type we have been 
discussing before and with the weights a n = 1 if n = ki and otherwise. Our 
ultimate goal is to find a weight for which convergence holds independently 
of the second dynamical system. Bellow and Losert define a good universal 
weight as a sequence which is a good weight for every operator induced by 
a measure preserving transformation on (Y, Q,v). That this is equivalent to 
being a good weight for every Dunford -Schwartz operator or every operator 
induced by an ergodic transformation is shown in their Theorem 1.4 which is 
due to Baxter and Olsen [B083J . Thus the earlier result of Brunei and Keane 
says that for a uniform sequence kj , then the sequence Oj = 1 if i = kj and 
otherwise is a good universal weight for a.e. convergence. 

The next area of study then is to try and determine what types of sequences 
make universal good weights for a.e. convergence. Section 3 of [BL85 shows that 
this result can be extended to show that a set of sequences T> strictly containing 
the bounded Besicovitch sequences (which include the uniform sequences of 
Brunei and Keane) are good universal weights for a.e. convergence. This is an 
extension of the Ryll-Nadzewski }RN75] work which shows that the bounded 
Besicovitch sequences are good universal weights for a.e. convergence. Then 
Bellow and Losert prove that for a strictly L-stable system, (X, T, n, T) and 




N-l 



3=0 
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any integrable /, the sequence a = {f(T n x)} is bounded Besicovitch for all 

x e X. 

Bellow and Losert extend the results of Blum and Reich BR77 and others 
to determine other characteristics of dynamical systems such as ergodicity, sat- 
uration or the spectral measure associated with the sequences which can lead 
to the sequence a = {f(T n x)} being classified as a good universal weights for 
a.e. convergence. 

In Theorem 5.4 of their paper, Bellow and Losert prove that if T has 
Lebesgue spectrum, then for every / £ L°°(n) there exists a set of full mea- 
sure Xf in X such that for every x G Xf, the sequence a = {f(T n x)} is a 
good universal weight for a.e. convergence. Bellow and Losert note that the 
result for K-automorphisms was previously shown using different methods by 

H. Furstenberg, M. Keane, J. P. Thouvenot and B. Weiss. 

As an extension of the work of Bellow and Losert, one can consider a dynami- 
cal system (X, F, fi, T) and function / and ask what properties of the dynamical 
system are required for the sequence a = (f(T n x)) to be /z-a.e. a good universal 
weight for a.e. convergence. The answer to this question is what is standardly 
referred to as Bourgain's Return Times Theorem. 

Theorem 1.10. Let (X, J 7 , fj,, T) be an ergodic dynamical system of finite mea- 
sure and f € L°°{ji). Then there exists a set Xf C X of full measure such 
that for any other ergodic dynamical system (Y, Q,v, S) with v{Y) < oo and any 
geL°°(v): 

1 N 

- ]T f(T n x)g(S n y) 

71 = 1 

converges v-a.e. for all x £ Xf. 

Let (X, T, /j,, T) and (Y, Q,v,S) be ergodic dynamical systems with n(X) < 
oo, v(Y) < oo, / € L°°(/Lt) and g £ L°°(v). Applying Birkhoff's theorem to the 
multiple term Cesaro average 

1 N 

n=l 

it is known that this average converges /i x v-&.e.. However, our focus is on 
showing that this convergence is "universal" and thus independent of the choice 
of dynamical system (Y, Q ,v : S) and function g which is a much stronger result. 

I. 3 Wiener- Wintner Results 

While the above developments were taking place in the realm of measure theory, 
some related results were being discovered in the field of harmonic analysis. 
In 1941, the publication of the Wiener- Wintner Theorem [WW41] gave 
a positive result for the Return Times Theorem when the second dynamical 
system is a rotation by a. 
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Theorem 1.11. If (X, fi,T) is a measure preserving system and if f £ 
L 1 (/x), then there exists a set X of full measure in X such that for x G X and 
for all 9 G K., the sequence 



1 N 
N ^ 

n=l 



f (T n x) 



converges. 



The proof by Wiener and Wintner relied on connecting the spectrum in 
the sense of Wiener with the point spectrum. Unfortunately, there was an er- 
ror in the proof of Wiener and Wintner. However the theorem is true and 
has been proven multiple times using a wide variety of techniques. The first 
correct version of the proof was given by H. Furstenberg Fur60j using the no- 
tions of joinings and generic points. We will see later the role of joinings and 
generic points in the proof of the Return Times Theorem. Another proof of the 
Wiener- Wintner Theorem which uses the Van der Corput's inequality |KN74| 
was given by Bourgain [Bo u90j in 1990. Actually Bourgain proved a stronger 
result referred to as Bourgain's Uniform Wiener- Wintner Ergodic The- 
orem which constitutes the proof that (1) implies (2) of the theorem below. 

Theorem 1.12. Given an ergodic dynamical system (X, J 7 , /x,T) and f G JC 
where JC is the Kronecker factor (the a -invariant algebra spanned by the eigen- 
functions ofT). The following statements are equivalent 



1. felC x . 

2. For [i-a.e. x, 



lim sup 

N a 



2-ninO 



o. 



Proof. Proof of (2) implies (1). We can decompose / into the sum /j + / 2 where 
fi G JC and f^ G JC . Using an orthonormal basis of eigenfunctions Cj each with 
eigenvalue e 27rl/3j we can write f\ as Y^fLi a j(f) e j where the convergence of the 
series is to be understood as being in T? norm. 

Assume that f\ ^ 0, thus a JO (/) ^ for at least one jo. Then we would 
have by the Spectral Theorem 



JV 

J ^Ea^k 



d/i < sup 



The left hand side can be written as 

N 



1 N 

-Y J h{T n x)e^ 



J j V n=l 



oo 1 N 

^ = EM/)i 2 ^E e2 ™ ( ^ o) 



By taking the limit with N we obtain a lower bound |aj (/)| 2 which is strictly 
positive if /i 7^ 0. □ 
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The Wiener- Wintner Theorem while discovered independently from these 
return times averages follows as a consequence of the Return Times Theorem. 
Thus any developments with regards to the Return Times Theorem have conse- 
quences for the Wiener- Winter result and thus may have other implications in 
the field of harmonic analysis. Similarly, developments with the Wiener- Wintner 
Theorem, may give ideas for expanding the Return Times Theorem. Our goal 
in this paper is to discuss the developments thus far and highlight some of the 
implications this may leave for future studies. 

2 Development 

In 1988, J. Bourgain Bou88 released a preprint of his proof of the Return Times 
Theorem. His statement of the Return Times Theorem is as follows: 

Theorem 2.1. Let (X , J 7 , fi, T) be a dynamical system with fi a finite positive 
measure and T and ergodic measure-preserving transformation. The generic 
return-time sequence is defined as A w = {n 6 Z + : T n uj <E A} for any set A 
of positive measure and any point uj in X. For almost all uj the sequence A u 
satisfies the pointwise ergodic theorem. Given any dynamical system (Y, Q, i>, S) 
where v is a finite measure, the averages 



converge v-almost surely for any g in L (v) where Ajv = A u n [1, N]. 

This important result impacted both the fields of ergodic theory and har- 
monic analysis and fully generalized the work with good universal weights which 
was presented in the work of Bellow and Losert |BL85j . This initial fifty-two 
page proof used difficult techniques from harmonic analysis which made it chal- 
lenging to analyze the proof, nevertheless to extend those techniques to proving 
other convergence questions. In 1989, J. Bourgain, H. Furstenberg, Y. Katznel- 
son and D. Ornstein |Bou89j published a more concise proof of the Return Times 
Theorem (found in the appendix to Bourgain's "Pointwise ergodic theorems for 
arithmetic sets" ) which utilized several key ergodic theory concepts such as the 
Rohlin Towers Lemma |DGS76j . One assumption that they make in their argu- 
ment is that the function / has finite range. A discussion of how to extend to 
a function / which does not have finite range along with a detailed analysis of 
their argument can be found in either |Ass03c| or |Col95j . 



2.1 The BFKO proof of Bourgain's Return Times Theo- 



One of the key techniques used in the proof is to study the set of points on 
which the theorem holds true. To do this we begin by looking at the set of 
points for which the Birkhoff averages converge. This concept was introduced 
by Furstenberg [Fur81 in the setting of regular measure-preserving systems. 




rem 
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Definition 2.2. A measure-preserving system (A, J 7 , fj,, T) is call regular if the 

space A is compact metrizable, the cr-algebra JF is Borel and the transformation 
T is continuous. 

Definition 2.3. Let (X, J 7 , n,T) be a regular measure-preserving system, xq £ 
X and /i € A4y(A) (the T-invariant measures on X). A point xq £ X is a 
generic point for /i if 



-^/(T"x )^ y fd,, 



N 

n=l 

for every continuous function / £ C(A). 

It is known that every separable measure preserving system is equivalent to 
a regular one (see |Fur81j ). thus for our purposes we use an extended definition 
of generic to the context of a general measure preserving system. In this case 
the gencricity is dependent upon the choice of the function /. 

Definition 2.4. Let (X,J-,fj,,T) be an ergodic measure preserving system. Let 
/ be an integrable function defined on A. A point xq £ X is generic for / if 

N 

fdfi 



N 

Thus Birkhoff's Pointwise Ergodic Theorem implies that for any / £ L 1 {^) 1 
H-a.e. x £ X is generic for /. 

The proof in [Bou89 begins by decomposing our function / with respect 
to the Kronecker factor K, in order to handle the two factors separately in the 
proof. When f £ JC the statement follows fairly easily from Theorem 
When / £ JC- 1 (and has finite range) the authors establish results which describe 
the set of full measure Ay on which the averages converge. This set is namely 



the intersection of the set of full measure on which Theorem 1.3 holds for / 
(because of the possibility that one chooses g = 1) and the sets 

Ai - jlim i f(T n x)J(T^y) = for M -a.e. y j 

and 

oo 

A2 = [J {x £ X : x is generic for xa ±T for any A C F n } 

n=l 

where F is the range of / and Ti : A — > F n is defined by 

T 1 (x) = (f(T 2 x),...J(T n+1 x)). 

As / £ JC it has continuous spectral measure and thus the set Ai has full 
measure. The other sets are of full measure by Theorem |1.3| 
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The authors then make the assumption that Theorem 1.10 does not hold on 
this set Xf. Thus there exists some other ergodic dynamical system (Y, Q,u,S) 
and g € L°°{v) such that the set 



B= lyeY 



lim sup 

N 



1 N 

-Y / f(T n x)g(S n y) 



n=l 



>o 



has positive measure. Without loss of generality this can be reduced to the 
situation where for some positive a a set 



Bi = < y € Y : lim sup i?e 

I fc JV>fc 



\ n=l 




(2.1) 



which has positive measure. Using the Rohlin Tower Lemma, given any 
6 > one can find an integer K and a set B 2 C B\ of positive measure such 
that the S(B 2 ), S 2 (B 2 ), . . ■ , S K (B 2 ) are pairwise disjoint and cover B 2 up to a 
set of measure less than | . Using the set Bi and the properties in the sets X\ 
and X 2 one can create a sequence of properly spaced ranges Rj — (Lj, Mj) on 
which the points behave poorly with respect to the return times average, but 
well with respect to the averages used in defining X\ and X 2 . Using these ranges 
and the set B one can create sequences {cn{y))n=\ (f° r some large enough N ) 
which are the sum of J layers (c 3 n (y))j =1 which have the following properties: 



1. For all j, n and y, the c J n (y) are uniformly bounded. 

2. Forj x ^i 2 , |iELi4 x (y)^(y) 

3- Re (±En=i ci(y)g(S n y) 



> - 

2 



< S and 
S, for j = 1, 



,J. 



This leads to a contradiction, thus the assumption that Theorem 1.10 does 
not hold on the set Xf is false. Using Ergodic Decomposition one can show that 
the theorem holds true if either T or S is a measure preserving transformation 
which is not ergodic. 



2.2 Extensions of the Return Times Theorem 

Definition 2.5. We will say that the return times theorem holds for the 
pair (L S ,L*) if for all / € L s (/j,) we can find a set of full measure Xf such 
that for each x £ Xf for all measure preserving systems (Y, Q,v,S) and for all 
g G L*{v) the averages 

1 N 

- ]T f(T n x)g(S n y) 

n=l 

converge z^-a.e. 
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As L°° is dense in L 1 , using the Banach Principle you can show that the 
return times theorem holds for the pair (L ,L°°) or (L 00 ,^ 1 ). Using Holder's 
Inequality one can show that the return times theorem holds for (L p , L q ) where 
- + - = 1. Using the Banach Principle we can show that this is true when 

| + | < 1. The case where | + | > 1 is explored below in Section pi 

Ornstein and Weiss [OW92j studied the set of points on which the Return 
Times Theorem holds in greater detail. They begin by rephrasing the Return 
Times Theorem in the following way. 

Theorem 2.6. If (X, J 7 , T, fj,) is an ergodic dynamical system and B G J- has 
positive measure, then for fi-a.e. x € X, the sequence {n6N: T n x G B} is a 
good universal sequence for the return times theorem, i.e. for any finite measure 
preserving system (Y, Q, v, S) and g G L 1 we have 

1 K f 

lim-^j(ry)= gdv 

k=i J 

for v-a.e. y G Y . 

The first extension that they present is a previously unpublished result of 
D. S. Ornstein, B. Weiss, H. Furstenberg, M. Keane and J. -P. Thouvenot. 

Theorem 2.7. If (X, J 7 , ji, T) has completely positive entropy, B G J- with 
positive measure and xq G X is generic for Xb, then the sequence n\ < ri2 < 
U3 < . . . of times of successive visits ofxg to B is a good sequence for the return 
times theorem. Thus for any finite measure preserving system (Y, Q,v,S) and 
g G L l (v) we have 

K 

l ™K^ f{Tnky)= / 9dv - 

k=l J 

As noted above in the creation of the set Xf in the BFKO proof of Theorem 
|1.10[ more was needed than the genericity of x to satisfy the Return Times 
Theorem. Thus one cannot extend Theorem 12.71 to a transformation which 
is merely ergodic using points which are merely generic with respect to the 
function \b- Ornstein and Weiss |OW92j describe the extra conditions on the 
point xq which are necessary in order for Theorem |2.7| to hold when T is an 
ergodic transformation. 

Definition 2.8. Let (X, J 7 , /i, T) be an ergodic measure-preserving transforma- 
tion and B a set of positive measure in T . A point Xo G X is self-sampling 
for xb if for u.-a.e. x G X 

1 N 

t^E^^^ 1 ) =M#) 2 - (2-2) 

i=l 

This condition is necessary for xq in order to create a good sequence for 
the Birkhoff ergodic theorem and the BFKO proof shows that it is a sufficient 
condition. To demonstrate the difference between genericity and self-sampling, 
Ornstein and Weiss present the following example. 



Survey of the Return Times Theorem 



13 



Example 2.9. Let (X, J 7 , T, /i) be a transformation with —1 in the spectrum 
so that there is a set B of measure \ with n{TB l~l B) = 0. A point Xo that 
visits B at the following times i 



• if (2n)\ < i < (2n + 1)! and i is even, 

• if (2n + 1)! < i < (2n + 2)! and i is odd, 



will be generic for \b, but for /z-a.e. x & X condition equation (2.2) will fail to 
hold. 

This example has some discrete spectrum. We may wish to consider dynam- 
ical systems which are restricted from having discrete spectrum. Consider the 
following definition. 

Definition 2.10. A measure preserving system (X, T, T, /j,) is weakly mixing 
if 1 is the only eigenvalue of T. That is to say that the Kronecker factor is 
reduced to the set of constant functions. 

Ornstein and Weiss posed the question of whether or not genericity is neces- 
sary and sufficient for Theorem |2.7| in the case of a weakly mixing dynamical 
system. This question is still open. The convergence of return times averages 
in case of weakly mixing dynamical systems will be discussed in greater detail 
in Subsection 12.31 

The rest of Ornstein and Weiss's paper |OW92j is devoted to extending the 
Return Times Theorem to a certain class of groups. It consists of those groups 
G which have a sequence of finite sets satisfying 

• A x C A 2 C ■ ■ • , and U%L x A n = G 

• for all g e G, lim„ \gA n AA n \/\A n \ = 

• there is a constant M such that for all n 

lA^Anl < M\A n \. 



That is to say G is an amenable group with F0lner sequence A n satisfying 

Bm|.A^ 1 A n |/|i4 n | < oo. 

n 

Tempelman [Tem67j has shown that this class of groups satisfy Birkhoff 's Point- 
wise Ergodic Theorem and thus they are a nice class to look at with respect to 
return times. 



2.3 Unique Ergodicity and the Return Times Theorem 

Connecting this to our earlier discussion, Bourgain's Return Times Theorem 
Theorem 



1.10 



proves that for any ergodic dynamical system (X, J 7 , /x, T) of 
finite measure and / e L°°(fi) for /x-a.e. x £ X the sequence a = f(T n x) is a 



Survey of the Return Times Theorem 



14 



universally good weight for a.e. convergence with functions from L°°. We would 
like to see if we can establish other properties for dynamical systems which will 
ensure that the sequence a = f(T n x) forms a universal good weight for a.e. 
convergence. We begin by looking at the concept of uniquely ergodic dynamical 
systems. 

Definition 2.11. A transformation T is uniquely ergodic, if there is only one 
measure T-invariant probability measure on X. For example, the irrational ro- 
tation discussed in Example 1 1 . 6| is an example of a uniquely ergodic dynamical 
system. 

The importance of uniquely ergodic systems is summarized by the following 
result of Jewett |Jew70| and Krieger |Kri72| . 

Theorem 2.12. Let (X, T, /jl) be an ergodic measure preserving system of the 
nonatomic Lebesgue probability space (X,J-,fi). There exists a uniquely ergodic 
standard system (Y, CJ, S 1 , !/) which is isomorphic to (X,J-,T,/i). 

In the case of uniquely ergodic operators there is a much stronger version of 



Theorem 1.3 found in |KB37j . 



Theorem 2.13. Let (X, T, fi) be a uniquely ergodic measure preserving sys- 
tem. Then for any f G C(X) 

fdfi 



uniformly in X . 



Therefore if T is a continuous map of the compact metric space X to itself, 
then T is uniquely ergodic (with unique ergodic measure /i) if and only if every 
point of X is generic for the measure \x. 

Two versions of Theorem for uniquely ergodic transformations were 



proven independently by I. Assani Ass93j IAss03cj and E. A. Robinson [Rob94 
and are stated below. Extensions of the results of Assani and Robinson can be 
found in Walters |Wal96j , Santos and Walkden |SW07j . Lenz |Len09bl lLen09a] 
and M. Schreiber [Schl2j . 

Theorem 2.14. [Ass93, AssOSc] Let (X, J 7 , T, /i) be a standard uniquely ergodic 
system. If f £ C(X) n then 



lim sup sup 

N x t 



1 N 

- £ f(T n x)e 2 « mt 

n=l 



= 0. 



Theorem 2.15. l r Rob94l Let (X, T, /i) be a standard uniquely ergodic system. 
Let Mt be the set of eigenvalues for T and let Ct be the subset of eigenvalues 
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with a corresponding continuous eigenfunction. If X £ Ct U (Mt) c then for 
every continuous function f the averages 



1 N 



converge uniformly in x. 



Using the ranges construction in the BFKO proof, E. Lesigne, C. Mauduit 
and B. Mossc LMM94 , describe a criteria for a sequence to form a universal 
good weight for a.e. convergence to for functions in L . Note the connection 
between these criteria and the self-sampling definition from |OW92] . 

Theorem 2.16. Part 1: Given a bounded sequence u n of complex numbers 
such that for all 8 > there exists some L§ > such that for all L > L$ there 
exists some Mg l > such that for all M > Ms l 




£ [0,M] :Vne [L S ,L], 



n— 1 



^rn+k^k 

k=0 



<8\>l-6. 



Then for any probability measure preserving system (Y,Q,u, S) and every f £ 



n— 1 

hm- V Mfc • ff (S fc y) = (2.3) 

n n ' — ' 

fc=0 

for v-a.e. y. 

Part 2: Given a probability measure preserving system (X, J 7 , T, fi) and a 
bounded measurable function f of Y which is in K. ( the orthocomplement of 
the Kronecker factor), then for almost every x £ X the point x is generic for 
the function f in the dynamical system and 



n— 1 

lira - V f(T k x) ■ f(T k x') = (2.4) 
n n — ' 



n 

k=Q 



for \x-a.e. x' . 

Finally, if a point x satisfies equation (2.4), then the sequence u n — f(T n x) 
satisfies equation (2.3). 

Using this criterion, given that unique ergodicity as seen in Theorem |2.13| 
gave us the BirkhofPs Pointwise Convergence Theorem (with uniform conver- 
gence) for all x £ X, we pose the following questions initially posed by Assam 
and Host respectively. 

Question 2.17. If (X, F, fi,T) is a uniquely crgodic probability system can we 
strengthen Bourgain's Return Times Theorem to give uniform convergence for 
all x £ X. That is to say, is a = f(T n x) a good universal weight for a.e. 
pointwise convergence for functions in C(X)7 
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Question 2.18. If (X, J 7 , fi, T) is a weakly mixing dynamical system and / € 
C(X). Is the sequence f(T n x) a good universal weight for the pointwise con- 
vergence in L 1 for each x € XI 

In Proposition 5.3 of [Ass03c , Assani gives a partial answer to these ques- 
tions. For this result we need the following definition. 

Definition 2.19. A weakly mixing measure preserving system (X, J 7 , T, /i) is 
said to have Lebesgue spectrum if the spectral measure of each function 
/ G L 2 (fi) with / fdfi = 0, <7f, is absolutely continuous with respect to Lebesgue 
measure. 

Theorem 2.20. JAssOScf Let (X, J 7 , T, /i) be a uniquely ergodic system which 
is weakly mixing with Lebesgue spectrum. Consider f G C(X). Then for all 
x G X, the sequence a = f(T n x) is a good universal weight for the a.e. pointwise 
convergence in L 1 . 



2.4 A Joinings Proof of the Return Times Theorem 

Around the same time as the above work was being done, D. Rudolph was 
working on a proof of Theorem |1.10| as well. His proof which was published in 
|Rud94| transfers the problem to the study of shift invariant measures defined 
on the space of sequences. He then uses joinings of these measures to reach the 
same conclusion. Rudolph acknowledges that his proof follows essentially the 
same path as the BFKO proof |Bou89j . using the characteristics of genericity 
and self-sampling to give a proof by contradiction. 

Rudolph begins his proof of Bourgain's Return Times Theorem (Theorem 



1.10) by determining the set of full measure in X on which the theorem should 
hold true. He calls the set G(f). The set G(f) is created using the definition 
of a product-null function and aspects of genericity with respect to /, / (that is 
the self-sampling property). This is akin to the construction of the set X\ and 
ultimately Xf in the BFKO proof. It is exactly the set of points which satisfies 



equation (2.4) in the Theorem ofLesigne, Mauduit and Mosse (Theorem 2.16) 



Definition 2.21. Suppose that (X, J 7 , fi,T) is a dynamical system and / e 
L°°(u) with J fd/u, = 0. A function / is product-null if for u. x u,-a.e. (x%, x 2 ), 

n— 1 

]im-y2f(T i (xi))7(T i (x 2 ))=0. 
n n * — ' 

The link between product-null functions, spectral measure and the Kronecker 
factor is outlined in the following lemma of Rudolph. Note that this lemma 
actually corrects the corresponding statement in the BFKO proof. 

Lemma 2.22. For f £ with J fd/j, = 0, the following are equivalent: 

1. f is product-null. 



2. The spectral measure of f is non-atomic. 
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3. f is orthogonal to all eigen] 'unctions ofT, i.e. f £ K. . 
4-. For [i-a.e. ergodic component fj, of fi x fj, 

f(x 1 )J(x 2 )dfl = 0. 

As in the BFKO proof, Rudolph reduces the problem using various tech- 
niques to a simpler setting. Using spectral decomposition, he assumes ergodic- 
ity. Define H as the subset in L p (/i) of functions / for which the theorem holds 
true. Using Holder's inequality and properties of limits, one can show that H 
is a closed subspace. In a proof akin to the BFKO argument, Rudolph shows 
that all of the eigenfunctions lie in H . 

What if H were not all of L p (ii)l Then there would be have to be a product 
null / which was not in H. This is deduced by taking a function which is not in 
H and using the decomposition of its measure into atomic and continuous parts 
to create a product null function which must also not be in H . This brings us 
back to the earlier classification of G(f) which was based off of the averages for 
product null functions. Thus the theorem has been reduced to looking only at 
product null functions and averages which converge to rather than converge 
to some general limit. 

Let H* be all of those functions g £ L x (y) for which the return times averages 
do not converge to v-a.e. for this /. Using a similar argument as for H, one 
can show this is a closed subspace of L x {y). Thus there is a product null / 
and a function g such that for some element x £ G(f) there is a set of positive 
measure B on which 



lim„ 



-. 71— 1 



i=0 



> a (2.5) 



for some a > 0. This is akin to the set B\ and a constructed in equation (2.1 1 
of the BFKO proof. 

The key technique of Rudolph's argument is found in his Lemma 10 where he 
transfers over the above assumption to the language of measures and joinings. 
This gives you a series of measures which create a contradiction with the product 
null characterization and genericity for these points. Because the function / is 
product-null, you have points that are /,/ generic for a measure where the 
integral of the points is 0, but our contradictive assumption will allow us to 



show that because of equation (2.5) this integral is bounded below by a. 

In studying the current status of the study of return times averages, we will 
look at the problem from three different angles: the multiterm case, character- 
istic factors, and breaking the duality. Each of these areas has made significant 
progress since the publishing of the proof of the Bourgain's Return Times The- 



orem (Theorem 1.10). For each topic, we will present a bit of the historical 
background, the current results and some open questions still under considera- 
tion. 
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3 The Multiterm Return Times Theorem 

In 1989, D. Rudolph visited the Department of Mathematics at the University 
of North Carolina at Chapel Hill while he was working on his joining proof of J. 
Bourgain's Return Times Theorem. Discussions of these topics continued as I. 
Assani visited Maryland in spring of 1990 and D. Rudolph returned to Chapel 
Hill for the entire spring of 1991. In their discussions, I. Assani demonstrated 
how one could extend the Return Times Theorem result to pairs of functions 
satisfying the Holderian duality and mentioned as a follow-up the following 
problems: 

1. The break of duality for the Return Times Theorem 

2. The Multiterm Return Times Theorem 

At a conference held at the University of North Carolina at Chapel Hill in the 
fall of 1991, I. Assani again mentioned these two open questions. Specifically, 
he raised the question, "If the Kronecker factor characterizes the functions for 
which the return times limit is not zero, then what could be the factor which 
would characterize the three term return times or more generally the h term 
return times theorem?" An answer to this question is D. Rudolph's Multiterm 
Return Times Theorem that we will discuss in this section^] The second question 
on the break of duality is discussed in Section [5j Note that evidence of the 
validity of such result was first announced for the weakly mixing case in |Ass98a] . 

When we say multiterm return times we are looking for /Xj-a.e. convergence 
in the same universal sense for averages of the form 

N H 
n— 1 i—1 

(universal sense meaning that the sets of full measure Xf i associated with each 
bounded function fi depend only on those j with 1 < j < i). 



As the Wiener- Wintner Theorem (Theorem 1.11) was a useful tool in the 



BFKO proof of the Return Times Theorem (Theorem 1.10), it was a logical 



first step to look at a multiterm version of the Wiener- Wintner Theorem. 

Theorem 3.1. Let (X, T ', /i, T) be a measure preserving system and f € L 2 (/i). 
For [i-a.e. x, for any measure preserving system (Y, Q, v, S) and any g € L 2 {^i) 
for v-a.e. y and for all the sequence 



i N ~! 

- J2 f{T n x)g{S n V y 



n=0 

converges. 

1 See Math Review MR1489899 (99c:28055) for more information on the historical devel- 
opment. 
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If (X, J 7 , /i,T) is weakly mixing, it was shown in Ass93| that Theorem 3.1 



follows as a simple consequence of Bourgain's Return Times Theorem (Theorem 



1.10). The general proof of Theorem 3.1 was obtained by E. Lesigne, D. 



Rudolph and I. Assani [ALR95 . Their argument uses disintegration of mea- 
sures. As such they reduce the proof to the case of a regular measure-preserving 
system. The u-algebra used to characterize convergence (in the same role as the 
Kronecker factor played above) was exactly the one created by J. -P. Conze and 
E. Lesigne |CL84( [CL88J while proving the convergence in L 1 norm of the mul- 
tiple recurrence averages for totally ergodic systems 

1 N 

^E/ ioT ^ 2oT2 "/3 oT3 "- 

n=l 

This result showed a link between the study of the multiterm return times 
and Furstenberg nonconventional ergodic averages. Note that, at the time, 
for a general measure preserving system the norm convergence of the averages 



in equation (3.1| was not established. The connection between characteristic 
factors and return-time phenomena will be discussed in more detail in Section 

m 

The convergence of the multiterm return times averages was obtained in 
1993 and sent for publication in early 1994 for all positive integers H and L 1 
i.i.d. random variables by I. Assani in [Ass97] . This was the first multiterm 
return time theorem obtained for all positive integers H. In 1998, D. Rudolph 
|Rud98| proved the Multiterm Return Times Theorem for bounded functions 
or functions which satisfy Holder's inequality, (L Pi ) where 1 < i < K and 

Theorem 3.2. Let k G Z + . For any dynamical system (Xq, J-o, To, (io) cind any 
fa € L°°(/i), there exists a set of full measure Xf in X$ such that if xq E Xf 
for any other dynamical system (Xt,.Fi,7i,^i) and any f\ £ L°°(n{) there 
exists a set of full measure Xt^ in X\ such that if x\ € Xf t then . . . for any 
other dynamical system (X k -i, Fk-i, Ifc_i, Mfc-l) an d an V fk-l € £°°(Mfc-l) 
there exists a set of full measure Xf k l in X^-i such that if Xk-i € ^f k -i f or 
any other dynamical system (Xk, Fki Tt, Hk) the average 

i N 

- h(T?x)f 2 (T 2 n x 2 )f 3 (T?x 3 ) ■ ■ ■ f k (T£x k ) 
n=l 

converges [i k -a.e.. 

Note that putting this in the context initially discussed, this result states 
that for any j < k, the random sequences f\(T[x{) f 2(^2X2) ■ ■ ■ fjiT^Xj) are 
universal good weights for a.e. convergence. 

One key feature of this argument is that the author proves his result without 
having to create higher-order cr-algebras. By converting the question at hand to 
the setting of measures on M N where M is a compact metrizable space, he avoids 
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the method of factor decomposition exploited above in the BFKO proof of the 
Bourgain's Return Times Theorem. The goal of Rudolph's paper is to construct 
an inductive argument to deduce the Multiterm Return Times Theorem from 
Bourgain's Return Times Theorem for two terms. If it were true that whenever 
a>i was a universal good weight then the sequence aif(T l x) was a universal good 
weight, then the induction would follow directly from Bourgain's Return Times 
Theorem. This is not the case as is demonstrated in the example presented in 
|Rud98j which is described below. 

Example 3.3. Construct a bounded sequence a, with the following two prop- 
erties: 

1. For any measure preserving system (Y, G, v, S) and g € L°°(v) 

oo 
i=l 

but 

2. there is a process (X, J 7 , fi,T) and / so that for /i-a .e. x the sequence 



dif(T l x) will look like the sequence in (Example 2.9) OW92 which was 
shown to be not a good universal weight. 

Therefore, Rudolph attacks the problem by looking at another property 
of sequences which he will define as fully generic from which the inductive 
argument can be made. His proof involves four major steps. First, the setting 
is transformed to the language of measures on a compact metrizable space and 
the definition of fully generic will be made. Secondly, it is shown that for any 
shift invariant measure, /i, /i-a.e. sequence is fully generic. Next, he proves the 
induction step through his Theorem 1 which states: 

Theorem 3.4. Suppose m G Mg is fully generic. For a second compact 
metrizable space Mi consider 

= {^-i "= Mi : {rn , mi) € (Af x Mi) N is fully generic} . 

This is a set of universal full measure in Mf. 

Lastly, to connect this argument with the return times it will be shown that 
fully generic sequences of complex values create universal good weights. His 
inductive argument is first shown for sequences which are generic for weakly- 
mixing measures and then lifted to an argument for general shift-invariant mea- 
sures. Rather than reproducing his argument here, we include some of the key 
terminology defined by Rudolph for transforming the return times argument to 
one about measures and joinings. 
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3.1 Definitions 

We will begin by discussing how and why we can transfer the statement of The- 
orem [3T2| to the context of measures on a compact metric space. In this case we 
have bounded functions fi, ■ ■ ■ , fk each defined on different measure preserving 
systems (Xi, J-"j,/Xj,Ti). Since the functions are bounded complex- valued func- 
tions, we can consider all of them as mapping onto some compact metrizable 
spaces Mi. For each fi we consider the map &i which maps from Xi to Mf by 
$i(x) = (fi(x), fi(Tix), fi(T?x), filTfx), . . .). Thus each fa induces a measure, 
we'll call /i* on Mf where H*{A) = /Lt i ($ _1 (A)). We can then transfer each one 
of the systems (X,-, J 7 ,-,^,^-) and functions /, to the system (Mf, A) , /«*, a) 
where a is the left shift <j(m) — a((mo, mi, m%, . . .)) = (mi, m?, m^, . . .). So 
now each system is now viewed on a compact metrizable space under the same 
transformation. 

If we study the concept of gencricity in Mf for arbitrary shift-invariant 
measures on Mf and bounded functions defined on Mf, we will be able to 
deduce Theorem 13.21 in the form in which we are interested. To see this 
consider the function it defined on Mf by 7r(m) = 7t((too, mi, 7712, . . .)) = rno, 
then studying the genericity of a point m = (mo, mi, 7712, . . .) with respect to 
the function ir means looking at averages of the form 

1 N 1 N 

n— 1 n—1 

So one specific choice for fh would lead to the averages 

1 N 

71=1 

Similarly studying the genericity of pairs of points (mo, mi) in some space (M x 
Mi) N leads to the very specific case of averages of the form 

1 N 

-^T/O^o)/!^!). 

ra=l 

Since Rudolph's characterization leads to an inductive argument he will be able 
to deduce the general result of Theorem |3.2| from the case of genericity over 
two terms as stated in Theorem |3.4| From this description it is clear that his 
result covers a much wider range than we are interested in for the Multiterm 
Return Times Theorem. We are just interested in the measures of the form /z* 
and points in Mf which come from mapping by <pi, but he proves his argument 
for general measures and points on M N . 

First, we define some key concepts in terms of measures on a compact metriz- 
able space. Let M be any compact metrizable space and m = (mo, mi, ra-i, . . .) 
be some arbitrary element of M N . We will let a represent the left shift transfor- 
mation as above. Define Ai(M) as the set of measures on M N . Then Ai s (M) 
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shall represent the shift-invariant measures on M N and j\4 e (M) the ergodic 
measures on M N for a which are the extreme points of A4 S (M). In this context, 
we express the genericity as follows. 



Definition 3.5. We say v G M(M) is generic for fi e M S (M) if 

N-l 

N 



1 N-l 



3=0 

A point to 6 M N is called generic for a measure /i 6 .M S (M) if the point 
mass at to, is generic for /i. 

Note that this is e quiva lent to the expressions for genericity discussed above 
(Definitions 2.3 and 2.4) because for any /z-measurable A C M N we have 



to is generic for ii is generic for fx 

1 N ^ 



N-l 

AT 
1 

iV 



#{jG [0,7V- 1] : tog <t-''(A)}->/*(A) 



a n ** 



N 

3=0 

Definition 3.6. For a measure v to be pointwise generic for a shift-invariant 
measure fi requires that for z/-a.e. to, the point to is generic for some ergodic 
measure fJ,(m) with the property 

^(fh)dv = /i. 



It follows from Theorem 1.3 that any shift-invariant measure is pointwise 
generic with respect to itself. This is because Birkhoff's Pointwise Ergodic 
Theorem implies that for any fi € A4 S (M), then for /i-a.e. to £ M N and any 
/x-measurable A c M N we have 



^ E XA{<J J {m)) -+E( X A\Inv) (to) 

and 

E (xa\ Inv) (m)dfj, = I XAdfi = n(A). 
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Definition 3.7. We say m is fully generic for u. e Ai e (M) if 5^ x /i N is 
pointwise generic for /x x /x N . A point rh £ M N is a fully generic point if it is 

generic for some measure /x € M e (M) and is fully generic for ti. 

Thus for 5^ x /i N -a.e. sequences of points (m , mi, m 2 , ■ • .), the sequence 
(mo, mi, m 2 , . . .) is generic for some ergodic measure /i((m , mi,m 2 , . . .)) with 
the property that 



/ 



/x((m ,TOi,TO 2 , • • -))d{6,n x /x N ) = /x x // 



So for /x N -a.e. (mi,m 2 , • • -)i the sequence (m, mi,ra 2 , . . .) is generic for some 
ergodic measure /x((mi,m 2 , . . .)) with the property that 

J /i((mi,m 2 , . . .))c^ N = M x A* N - 

For every /x x /immeasurable set A and /x N -a.e. (mi,m 2 , . . .) we have 

(MX^)^) = Jx^d^x^) 

= J J XAd^((rh l7 rn 2 , ■ ■ -))dfj, N 

= / lim x a x <7 x •• - ) j ((m,mi,m 2 , • • -)) d M N 

j JV-1 

lim ]v ^ Xa (^( m )V(™i)V(™2),-..)) rf M N 



J=0 



Let (j, £ M e (M). If v4 is a /i-measurable set, then the set A = Ax{}x {}x • • • 
is a it x /immeasurable set. If rh is fully generic for /x, then by the above analysis 

/x(A) = (/xx/x N )(A) 



AT-l 



J 3=0 
N-1 

J 3=0 
1 N ~ 1 



N 

3=0 



Thus m is generic for /x as well. 

Definition 3.8. A set i C M N is of universal full measure if for all /x € 

.M S (M), /x(A) = 1. It is enough to check this just for the ergodic measures. 
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The companion statement of Theorem 1.3 in this new context is given 
below. 

Lemma 3.9. The set of points fh £ M N that are generic for an ergodic measure 
fi is a set of universal full measure. That is to say, for any \x £ M e (M) 

/i({m : fh is generic for fi}) = 1. 

Using the above definitions, we wish to establish the following result con- 
cerning the set of fully generic points. 

Theorem 3.10. Let A = {fh £ M N : rh is a fully generic point }. Then the 
set A has universal full measure. 

Let /i £ A4 e (M). Let A^ = {fh £ M N : rh is a fully generic point for /z}. 
Then 

p.eM c (M) 

To show that A is of universal full measure, I need only show that /J,(A^) = 1 for 
all /i £ M. e (M). As /i x f/ 1 is a shift-invariant measure, it is pointwise generic 
with respect to itself. Thus the set of all points in rh = (fh ,fh ll fh2, . ■ .) in 
M N x (M N ) N which are generic for some ergodic measu re i^( fh) has full measure 
with respect to /i x /i N . Call this set B^. By Lemma 3.9 the set of all points 



mo such that rho is generic for [i has full measure with respect to /i. Call this 

x M N x M N x M N x • • • . Then B^. 

3^2- Then B^ 

respect to /i x /i N . If (fh, rh\, rhz, . . .) € B ma , then m G and thus ^{A^) = 1. 



set G M . Define 5^,2 = G M x M N x M N x M N x ■ ■ ■ . Then S Mi2 has full measure 
with respect to fix fi^. Let B^ — B mUi \ fl Su,2' Then has full measure with 



4 Characteristic Factors 

As you can see in the BFKO proof of the Return Times Theorem one of the 
keys to the argument was to break up the function using the Kronecker factor 
in order to prove the result independently for both the eigenfunctions and those 
functions in the orthocomplement of the Kronecker factor. Using factors in 
proving convergence in ergodic theory has long been a very useful tool. The no- 
tion of a characteristic factor is originally due to H. Furstenberg and is explicitly 
defined in |FW96j . 

Definition 4.1. When the limiting behavior of a non-conventional ergodic av- 
erage for (A, (j,, T) can be reduced to that of a factor system (Y, Q, v, T), we 
shall say that the latter is a characteristic factor of the former. 

For each type of average under consideration, one will have to specify what 
is meant by reduced in the given case. One approach to studying the con- 
vergence of nonconventional averages is to find the minimal characteristic 
factor which is the smallest factor which is characteristic for a given type of 
recurrence. In the case of Furstenberg and Weiss [FW96 , they define the notion 
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of characteristic factor when finding a characteristic factor for averages of the 
type 

1 N 

n=l 

Therefore their specific definition of characteristic factor is as follows. 

Definition 4.2. If {pi(n),ft(i), • ■ • ,pk(n)} are k integer-valued sequences, and 
(Y, Q, v, T) is a factor of a system (X, J 7 , /i, T), we say that Q is a characteristic 
factor for the scheme {p 1 (n),p 2 (n), . . . ,p k (n)}, if for any /i, / 2 , • ■ ■ , /fc € 

L°°(ii) we have 



- £(/ioT*< B >) ■ ■ • (f k oT*»W)~ ^(E(/ 1 |0)o^(«)) • • • (E(/ fc ,0oT»*< n >) 

n— 1 n— 1 

converges to in L 2 (fi). 

4.1 Characteristic Factors and the Return Times Theo- 
rem 

The proof by D. Rudolph of the Multiterm Return Times Theorem (Theorem 



3.2 1 gives an elegant proof of the theorem, but avoids the creation of the null 
set off of which the averages converge and the discovery of the factor which 
was characteristic for the return times averages. Both the null set and the 
characteristic factor played a key role in the proofs of Theorem |1.10| for two 
terms in |Bou89j and |Rud94) . thus there is some interest in identifying what 
factors are characteristic for the Multiterm Return Times Theorem (Theorem 



3.21 



In 2012, I. Assani and K. Presser published an update |AP12j of their earlier 
unpublished work |AP03j on characteristic factors and the Multiterm Return 
Times Theorem. In a reversal of the BFKO argument which used the properties 
of the characteristic factor to help prove the Return Times Theorem, Assani and 
Presser use the convergence of the multiterm return times averages guaranteed 
by Rudolph's Multiterm Return Times Theorem [Rud98j to demonstrate the 
convergence properties for two types of factors. 

We first consider the factors used by H. Furstenberg to prove Szemeredi's 
Theorem |Fur77j . These factors are called k step distal factors in |Fur77j . We 
denote these factors as Ak using the notation from Ass03a where these factors 
were shown to be L 2 -characteristic for the averages 



n— 1 i 



Definition 4.3. Let (X, J 7 , fi,T) be an ergodic dynamical system on a prob- 
ability measure space. The factors Ak arc defined in the following inductive 
way. 
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• The factor Aq is equal to the trivial <7-algebra {X, 0} 

• For k > the factor Ak+i is characterized by the following. A function 
/ € A k+1 if and only if 



1 H 

N k+1 (.f) 4 := Hm - H E (/ ' / ° rh l^*)|| 2 = 

h=l 



In the aforementioned paper, we prove that these seminorms are well-defined 
and characterize factors which are successive maximal isometric extensions. 

Theorem 4.4. Let k be any positive integer. For any ergodic dynamical system 
{X, J 7 , fx,T) and for each f € L°°(fi) we can find a set of full measure Xf such 
that for each x £ Xf, for any other dynamical system (Y 1; Gi, Si, Vi) and any 
gi 6 L°°(yi) with ||<7i||oo ^ 1; there exists a set of full measure Y gi such that for 
each j/i in Y gi then . . . for any other dynamical system (Yfc_i, Gk-i, Sk-i, v k—i) 
and any gk-i € L°°{yk-i) with ||fiffe || oo < 1 there exist a set of full measure Y gk _ 1 
in Yk-\ such that ifyk-i € Y gk l for any other dynamical system (Yfc, Gk, Sfc, Vk) 
for Vk-a.e. y k 

• the average 

1 N 

- if( Tnx ) - nf\Ak){T n x)] 9l {S n iyi ) g2 (Sqy 2 ) ■ ■ ■ gk{S n k yk) (4.1) 
n=i 

converges to 0. 

• Thus for f 6 A^ the average 

N 

V 



1 N 

- J2 f(T n x) gi (S? yi )g 2 (SZy 2 ) ■ ■ ■ g k {S n k y k ) 



converges to Vk-a.e.. 

• Also we have the following pointwise upper bound for our limit 

i 

<CN k+1 (ff 



lim sup 



1 N 

- J2 f{T n x) 9l {S^y)g 2 {S^ V2 ) ■ ■ ■ g k {Sly k ) 



n=l 



(4.2) 



An important aspect of the equation (4.2) is the control of the orbits made 
by the seminorms N k +i- 

The study of the nonconventional Furstenberg averages has seen important 
progress being made in the last seven years. In [HK05] and |Zie02| the Host- 
Kra-Ziegler factors Zk were created independently by B. Host, B. Kra and T. 
Ziegler and were shown to be characteristic in L 2 norm for the Furstenberg 
averages. 
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Definition 4.5. Let (X, J 7 , n, T) be an ergodic dynamical system on a proba- 
bility measure space. The factors Zk are denned as follows. 



• The factor Zq is equal to the trivial a-algebra. 

• The factor Z\ can be characterized by the seminorms 



h=i J 



where 



• The factor Zi is the Conze-Lesigne factor, CL. Functions in this factor 
are characterized by the seminorm ||| ■ ||| 3 such that 



h=l 



0. 



A function / e CC 1 - if and only |||/||| 3 

• More generally B. Host and B. Kra showed in [HK05J that for each positive 
integer k we have 



2 fc+i 



2 

I He ) 



(4.3) 



h = l 



with the condition that / € Z k L _ 1 if and only if 



0. 



Theorem 4.6. Let [X, J 7 , fx, T) be an ergodic measure preserving system. The 
Host- Kra- Ziegler factors Zk are pointwise characteristic for the multiterm return 
times averages. 

As the Zk factors are smaller than the factors Ak, and thus Ah C Z^-, the 
fact that the Ak factors are pointwise characteristic for the multiterm return 
times averages is a consequence of Theorem |4.6| But in our proof of Theorem 
|4.4| using the seminorm defining the factors Ak we obtain pointwise uniform 
upper bounds of the multiterm return times averages. With the Zk factors we 
do not have such pointwise estimates. The uniform upper bounds are derived 
after integration combined with a limsup argument See [API 2) and Example 
5.1 in T. Eisner and P. Zorin-Kranich [EZK12] . 



5 Breaking the Duality 



As mentioned above the original Return Times Theorem (Theorem 1.10) can 
be easily extended to L p (fi) and L q (v) where - + ~ < 1 using the Banach 
Principle and Holder's inequality. What can be said about convergence of return 
times averages when 1 + 1 > 1? 

A first motivation for this question comes from the following result due to 
I. Assani (1990, see page 141 in [Ass98b| ). 
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Theorem 5.1. Let (X, J 7 , /x, T) be a measure preserving system and f € L {/j,) 
then for fi-a.e. x, for each measure preserving system (Y,Q,i/,S) and every 
g G L l (y) the averages 

N 

X 

converge in L (y) norm. 

The proof is a consequence of the Wiener- Wintner Ergodic Theorem (Theorem 



i " 



1.11), the Spectral Theorem and the Maximal Ergodic Theorem |Ass03c| . In 
view of this result one could look for the true nature of the dynamic involved 
in the return times theorem allowing to go beyond the Holderian duality. 

A first evidence of the possible validity of the return times theorem beyond 
the Holderian duality was shown in |Ass97| and |Ass98bj . In [Xss97j, I. Assani 
showed that if (X, F 1 /j,, T) is a measure preserving system and / € L p , 1 < p < 
oo, then we can find a set Xf of full measure such that for every sequence of 
independent, identically distributed (i.i.d) random variables X n in L 1 we have 

lim /(T^)X„ M=o 

n n 

for a.e. uj. T his leaves the case where p = 1 which is addressed below in Sub- 
section 5.2 on the (L 1 ,^ 1 ) case. Then in Ass98bJ, I. Assani showed that a 
sequence {X n } of i.i.d. random variables defined on the probability space ft and 
having a finite p-th moment for some 1 < p < oo are universal good weights for 
a.e. convergence of the return times averages on L q where 1 < q < oo. That 
is to say that the return times holds for the pair (LP,L q ) when 1 < p, q < oo 
when we restrict the initial term to i.i.d. random variables. 

Subsequent work by C. Demeter in |Dem04 looked at breaking the duality 
in weighted ergodic averages of the form 



1 N 

-Y,a{k)f{T k x) 



n=l 

where a(k) is a sequence of complex numbers and T is a linear operator of some 
LP space with 1 < p < oo and f E L p . 

Further progress along these lines can be seen in Theorem 1.6 of C. Demeter, 
M. Lacey, T. Tao and C. Thiele |DLTT08j . see also |Ass97j and |BJLO04j . which 
is stated below. 

Theorem 5.2. Assume that either p > 1 and q — 1, or p — 1 and q > 1. For 

each dynamical system (X, J 7 , fj,, T) and each f € L p (fi) there is a set X* C X 
of full measure, such that for each sequence of LP i.i.d. random variables Y n 
defined on the probability space (Y,Q,v) and each x € X* , 

1 N 

n=l 

exists for v-a.e y. 
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Definition 5.3. A sequence of complex numbers {a(k)} is p-Besicovitch if 
{a(k)} is in the closure of the trigonometric polynomials in the semi-norm 

1 N 

limsup — 22\a{k)\ 
N k=l 

In |LOT99] . M. Lin, J. Olsen and A. Templeman showed that if a(k) is q- 
Besicovitch and / £ LP where - + i = 1 then the above averages converge for all 
/ £ LP when T is a Dunford-Schwartz operator. J. Baxter, R. Jones, M. Lin and 
J. Olsen [BJLO04] present a construction which demonstrates that the duality 
is necessary in the case that q — 1. They show that if {X n } is a nonnegative 
i.i.d. sequence with J5(|Xi|) < oo and E(Xi) = which is not essentially 
bounded then there exists a sequence {a(n)} of nonnegative numbers such that 
the sequence of its arithmetic means converges to zero and the sequence of 
weighted averages of the X n taken with the weights a(n) is as bad as possible, i.e. 
the liminf heads to while the limsup heads to oo (see also |CL09llLW07| ). In 
|Dem04) . C. Demeter showed that the duality is necessary for the g-Besicovitch 
sequences to make universal good weights when q > 1 as well. This result 
of Demeter is simplified in the paper [DJ04] where C. Demeter and R. Jones 
describe a possible approach to dealing with duality with respect to the return 
times. 




5.1 Hilbert Transforms 



While the Return Times Theorem (Theorem 1.10 1 looks at the convergence 
of weighted Cesaro averages, these averages are closely related to the discrete 
ergodic Hilbert transform 

n ^— ' k 

k=—n 

which is a tool of Harmonic Analysis which was first studied by M. Cotlar 
|Cot55| . see also A. P. Calderon |Cal68| . 

The connection between the convergence of the Cesaro averages and the 
existence of the ergodic Hilbert transform has been established by R. Jajte 
|Jaj87| for L? functions. For the one-sided ergodic Hilbert transform a simple 
partial summation argument shows that the convergence of the series XmLi if 
implies the convergence of the averages jj 2n=i a n- 

Conjecture 5.4. Given any dynamical system (X, J 7 , /x,T) and / £ L p (fi), there 
exists a set of full measure Xf £ X such that for all x £ Xf and for every other 
dynamical system (Y, Q, v, S) and g £ L q (v), the limit 

lim f fP**MS>y) 

n — ^ h 



exists for i/-a.e. y. 
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As referenced by [Tal95] . Lacey and Marcus have shown that the conjecture 
is false for p — 1 using the family of rotations on the torus as the initial weight 
and applying that to g(y) — e 27 " y . They actually proved the existence of an L 1 
sequence {ATj,} of i.i.d. random variables such that for almost every x 



hm V ^f" M 

k=—n 



fails to exists for some 9. Similarly, Talagrand showed the same negative result 
holds for the one-sided version of the series in the course of proving the following 
theorem. 

Theorem 5.5. Given an i.i.d. symmetric sequence of random variables {Xk}, 
the random Fourier series 



^ke 



^ k 

k=l 

converges uniformly for a.e. 9 if and only if {Xk} is in L log log L. 

However, Cuzick and Lai |CL80j proved that for each LP sequence {Xk} of 
i.i.d. random variables where p > 1 there exists a set X* € X with the property 
that 

Um ^ Xk(x)e 

k=—n 

exists for each x g X* and G T. 

To study duality and the weighted convergence with respect to the Hilbert 
transform, it has become a customary strategy following in the models as above 
to look at the role that i.i.d. sequences play in the convergence of such averages. 
In |Ass04a| . the first author proved the following about duality and the Hilbert 
transform. 

Theorem 5.6. Let (X, J-, /i, T) be a dynamical system and f € There is 

a set Xf G X of full measure such that for each sequence {Yfc} of i.i.d. random 
variables defined on the probability space [Y,Q,y), with Y± € L q (X),q > 1 and 
each x € Xf 

f(T k x)Y k (y) 



lim > 



k ^ 

K 

k— — n 



exists for v-a.e. y. The result fails when q = 1. for some f in every ergodic 
dynamical system [X, J 7 , /!, T) . 

A companion result was proven in |Ass04a| this time with the i.i.d. sequence 
playing the role of the weight in the average. 

Theorem 5.7. Let {Xk} be a mean sequence of i.i.d. random variables defined 
on the probability space (X, J- ', p) which are assumed to be in LP for some p with 
1 < p < oo. Then there exists a subset X* € X of full measure such that for 



Survey of the Return Times Theorem 



31 



each x G X*, the following holds: for any dynamical system (Y,Q,v,S) and 
g G L r with 1 < r < oo 

n. l- — 1 h 



k——n 



exists for v- a. c. y. 

Cuny Cun05j showed that this last result could be extended to the case 
where X\ G LlogL and g G L\ogL. Demeter [Dem06] showed that for the 
series 

Um ^ X k{x) Y k{ y) 

n ^ k 

k— — n 

one can obtain a.e. convergence if both X n and Y n are i.i.d. sequences in L 1 . 

In [DLTT08] . C. Demeter, M. Lacey, T. Tao and C. Thiele showed that 
Conjecture |5.4| is true if 1 < p < oo and q > 2. One of their main results is 
the following theorem. 



Theorem 5.8. Let K 

ments: 



— > K be an L 2 -kernel satisfying the following require- 



K gC°°(]R\{0}) 
1 



\K(0\<minil, 



< i . 

> mm 
1^1™ 



1^1- 



,Vf^0,n>l. 



(5.1) 
(5.2) 

(5.3) 



Then the following inequality holds for each 1 < p < oo. There exists a finite 



constant C p such that 









sup 


sup 


2*/ 


ll9llt2(E) =1 


feez 



^ y /(^ + 2/).9(^ + y)^(J)rfy 



L 2 ( 



< Cp||/lkf(R) 



where 



\\F\\ Ll (R) 



\F{z)\ 2 dz 



1/2 



Later in their paper (Corollary 3.8), they transfer this result to the ergodic 
setting and show that the associated return times theorem holds when 1 < p < 

00 and q > 2. This breaks the duality but leaves ope n the remaining cases where 

1 + 1 < 2. In particular, if one looks at Theorem 5.2 one could reasonably 
ask: 

Question 5.9. Does the Return Times theorem hold for p = 1 and q > 1 or 
p > 1 and q = 1? 
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Because of the failure of the convergence of the series 



oc 



f{T n x)g{S n y) 



E 



n 



n——oo 



for p = 1 and q — oo (by the Lacey, Marcus result (referenced in |Tal95j ) one can 
not expect the use of the Hilbert transform series to answer the above question. 
One can observe that while the Cesaro averages of nonncgative functions is 
nonncgative, the Hilbert transform does not have this property. In |Dem09] . 
however, it is announced that the range of validity of the return times could be 
extended to \ + ± < 3/2. See also jOST+j . 

A second corollary of Theorem |5.8| is Theorem 3.4 of the Demeter, Lacey, 
Tao and Thiele paper which only shows that the set of convergence is closed. 
Having no obvious dense set the authors refine their techniques to prove a vari- 
ational inequality which establishes the pointwise convergence of the ergodic 
Hilbert transform for the specific range of p and q. 

The method used in [DLTT08] adapts the tools developed by M. Lacey and 

C. Thiele in their understanding and applications of the celebrated Carleson- 
Hunt theorem on the convergence of the Fourier series of LP functions and the 
bilinear Hilbert transform and Calderon conjecture. The upper bound of 3/2 
appears in their papers. 

One wonders if one needs to use such powerful tools to solve the problem of 
the break of duality. In other words is the difficulty of breaking the duality in the 
return times theorem at the level of the Carleson-Hunt theorem? This seems to 
be the case as indicated by T. Tao in one of his blogsj^] As such, it would appear 
that the problem of the break of duality for the return times theorem could shed 
new light on the Carleson-Hunt theorem and produce substantial refinements. 
A first step in this direction is a variational Carleson Hunt theorem obtained in 
|OST + | . Indeed, based on the Multiple Return Times Theorem established by 

D. Rudolph one can ask the following question. 

Question 5.10. What is the range of values (j>i , J>2 > £*3 > ■■■,Ph) for which the H 
term return times theorem holds? 



Does the return times theorem hold for the pair [L x ,L l )l In 1990, Assani 
[Ass03bj proved that for a finite measure preserving system (X, J-, [i, T) and 
/ G L}(n) then for /i-a.e. x G X the sequence {f(T n x)} is a good universal 
weight for the norm convergence in L . 

This initial result gives some support for the possibility that the return times 
theorem would hold for the pair (L 1 , L 1 ). To approach the question of the return 
time for [L , L 1 ), it was suggested in |Ass97| to look at the return times for the 
tail. 



5.2 The (L 



\L 1 ) Case 



2 http: //terrytao.wordpress. com/2007 / 12/ 1 1/ 
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Definition 5.11. The return times property holds for the tail for the 
pair (X s , L*) if for all / G L s (fi) we can find a set of full measure Xf such 
that for each x G Xf for all measure preserving systems (Y, Q,v,S) and for all 
g G L l {v) the sequence 

/(r"a;) g (5"y) 
n 

converges v-&.e. to 0. 

In |Ass97j . the first author showed that the validity of the return times for 
the tail for the pair [L , L 1 ) is equivalent to the following counting problem. 

Conjecture 5.12. For any measure preserving system (X,J-,fj,,T) and for all 
/ G L 1 ^) and /z-a.e. x 

# ( fceN: Mf£)>l| 
N*(f)(x) = sup — < oo 

n fl 

In [Ass98| . I. Assani showed that N*(f) G L 1 if / G L log L. In |DQ04| , C. 
Demeter and A. Quas showed that N*(f)(x) < oo a.e. when / G L log log L. 
However, in the papers [ABM05J, this conjecture is proven false and its connec- 
tion to return times is discussed in greater detail. The key result is Theorem 1 
of |ABM05j which states 

Theorem 5.13. In any nonatomic, invertible ergodic system (X, J 7 , /x, T) there 
exists f £ L\ such that 

sup = oo 

n n 

almost everywhere where 



k 



n 



The work in ABM05 demonstrates that with the above result the return 
times for the tail does not hold for the pair (L 1 ,^ 1 ) and thus the return times 
theorem itself does not hold for the pair (L 1 ,^ 1 ). As noted in |DQ04| , the 



method used in ABM05 to prove Theorem|5.13 shows in fact that for / in any 



Orlicz space strictly bigger than L log log log L we can still have N*(f)(x) = oo 
a.e. 

Question 5.14. Is N*(f)(x) < oo a.e. for / G LlogloglogL? 

A related question was raised in |CT12| by M. J Carro and P. Tradacete 
for the following related operator A introduced by I. Assani (see |ABM05j ) and 
defined pointwise as 

A(f)(x) =sup\-m(o<y<x: ^ X ~ ^ > A 

A I V 

In Section 5.1 of ABM04 we see some applications for this negative result 
for the Return Times in L . Assume that (£l,B,P) is a probability measure 
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space and Yi, Y2,. .. is a sequence of i.i.d. random variables of values in —1,1 
with P(Y n = 1) = a and P(Y n = -1) = 1 - <j where \ < a < 1. Set 

By the strong law of large numbers we know that 

lim^M= J B(y 1 ) = 2<7-l. 
n n 

If <t > 2 the for /x-a.e. cj we have lim n a„(u;) = 00. Fix such an w and let 
/ € L p (fi) where 1 < p < 00. It was proven in [LPWR94 that if p > 1 and 
<t > i , then the averages 

1 W 

n=l 

converge for /i-a.e. X. Using Theorem |5.13| we have the following result. 

Theorem 5.15. Consider a sequence of i.i.d. random variables Y\,Y2,ldots 
defined on a probability measure space (£l,B,P) of values —1, 1. Assume that 
P(Y n = 1) = a and P(Y n = —1) = 1 — a with a > \. Set a n (u>) = Y^k=i 
and fix uj such that lim„a„(o;) = 00. In any aperiodic dynamical system there 
exists a function f £ L 1 (/^) such that the averages 



1 N 

-J2f(T a ^x) 



n=l 



do not converge almost everywhere. 



6 Other Notes on the Return Times Theorem 
6.1 The Sigma-Finite Case 



As Birkhoff's Pointwise Ergodic Theorem (Theorem 1.3) actually holds for 



cr-finitc measure spaces, one question to consider is whether the Return Times 



Theorem (Theorem 1.10) can be extended to cr-finite measure spaces as well. 



This question was address by Assani in [As s99| in the following theorem. 

Theorem 6.1. Let (X,J-,fi.T) be a measure preserving system on the a -finite 
measure space (X, J 7 , [/,) . Given a set A with finite measure, then the sequence 
{XA(T n x)} is [i-a.e., a good universal weight for the pointwise ergodic theorem 
in L . 



The proof utilizes the method of Hopf 's decomposition )Kre85j to extend the 
BFKO result to a cr-finite measure space. 
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One cannot extend the Return Times Theorem (Theorem 1.10) to a more 
general situation with an infinite measure space. In Lacey |Lac97) the following 
theorem is used to show that there exists a sigma-finite measure preserving 
system (X, J 7 , fi, T) and a set A C X of positive finite measure so that for almost 
every x £ X and for every aperiodic measure preserving system (Y, Q,v, S) with 
v(S) = 1, there is a g £ L 2 {y) so that the averages 



r- 1 (x)J2xA(T m x)g(S m y) 

m—1 

diverge for z/-a.e. y where 

n 

Tn (x) = V XA(T n x), 



m—1 



Theorem 6.2. Let X m be non-negative i.i.d. integer-valued random variables 
such that P{X\ > A) ~ A~ Q as A — > +oo. Here < a < \, so that EX\ = +oo. 
Then with probability 1, for every aperiodic finite measure-preserving system 
(Y, Q, v, S) there is a square-integrable function gonY for which 

1 N 

A n (g)(y) = -J29(S T "y) 

n=l 

diverges for v-a.e. y, where the power of S above is r„ = Y) _, X m . 

This construction builds off of a creation of a simple random walk whose 
returns of the walk to the origin is almost surely a bad sequence along which to 
try the pointwise ergodic theorem. 

6.2 Recent Extensions 

In [HK 09 , B. Host and B. Kra have extended the Wiener- Wintner Theorem 



(Theorem 1.11) by showing the following. 



Theorem 6.3. Let {X, /i, T) be an ergodic system and f £ L 1 ^) Then there 
is a set of full measure Xf such that for every x £ Xf the averages 

1 N 

- J2 *nf(T n x) 

71=1 

converge for every nilsequence (a n ). 



B. Host and B. Kra. used Theorem 6.3 to also prove that the sequence 
(f(T n x)) is a.e. a good weight for the convergence in L 2 -norm of the averages 



N K 



N ^ 

n=l k=l 
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extending their result in citeHK-NEA where gt G L°° . The idea of mixing 
weights from a.e. multiple recurrence and the multiple return time theorem was 
introduced in [Ass00 j for weakly mixing systems. 

Note that a sequence (a n ) G l°° is a basic Z-step nilsequence if there exists a 
basic Z-step nilsystem (G/T, S), a point y G G/T, and a function F G C(G/T) 
such that a„ = F(S n y) for all n G N. In |EZK12j . a new proof of the result 
of Host and Kra is given by T. Eisner and P. Zorin-Kranich and extended to 
F0mer sequences. 



6.3 Wiener- Wintner Dynamical Functions 

Definition 6.4. Given an ergodic dynamical system (A, J 7 , /i,T) a function 
/ G is said to be a Wiener- Wintner function of power type a if we 

can find a constant C such that 



sup 
t 



1 N 

-£/(T«x)e 



2-xint 



< 



c 



for each A. 



Ergodic systems having a dense set of Wiener- Wintner functions in JC 1 - are 
called Wiener- Wintner dynamical systems (see |Ass03b] ). Such systems allow 
one to give simple proofs of the return times theorem, the a.e double recurrence 
[Bou90] , and the convergence of the one sided ergodic Hilbert transforms 



E 



f{T n x)g{S n y) 



and 



E 



f(T- n x)g(S- n y) 



For example, let us look at a proof for why the one sided Hilbert transform 
converges in the setting of Wiener- Wintner functions. 

Proof. Take / a Wiener- Wintner function of power type a > and (3 > such 
that /3a > 1. We have 



E 

JV=1 



sup 

t 



A" 9 



2-nint 



AT/? 



< 



c E 



< OO. 



We can fix x G Ay, a set of full measure, such that 



E^p 



AT=1 



A/ 3 



< oo. 
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This set is independent of the dynamical system (Y, Q,v). By the spectral the- 
orem we have for each positive integer N 



N 

/ jj'£f(T n x)g(S n y) 

J 71=1 



dv < Csup 
e 



1 N 

- f(T n x)e 2 ™ 8 



n=l 



Therefore for x £ Xf we derive the following 



E 

JV=1 



1 IV P 



AT/3 



This implies that for u-a.e y 



n=l 



N 



< 00. 



l ^Y J f{T n x)g{S n y) 



AT/3 



n=l 



converges to zero. 

For the general sequence we consider for each integer M the unique TV integer 
such that < M < (N + l) 13 . Then we can write 

M p N? M 

-Y J f{T n x)g{S n y) = — w Y,^ Tnx )^ Sn y) + M E f^'W^y). 

n=l 71=1 n=NI 3 

The last term goes to zero as it is dominated in absolute value by 

(N+l)* 3 

u E \f^\ 



M 



(in the case where g G which is equal to 



N f> 



- x: i/(T-*)i--]Ti/(T«*)i. 



M 



n=l 



And this last quantity goes to zero by the pointwise ergodic theorem for the 
function |/| and the fact that the limit of is one. 
The same argument works for the one sided series 



E 



f(T- n x)g(S n y) 



because for a / which is a Wiener- Wintner function of power type a we also 
have for each N 

1 N 

-]T.f(T-"x)e 2 -"* 



sup 

t 



71=1 



< 



c 



□ 
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With some extra work using a truncation method one can prove the above 
convergence (a.e. v) for functions g € L r (i>) for 1 < r < oo. 

As shown in [Ass04bj not all ergodic systems have such a dense set of func- 
tions in K,- 1 even with a rate as slow as a logarithm. But interesting systems 
like i^-automorphisms do. 

7 Conclusion 

For interested readers, there are several possible directions of study concerning 
return times to consider in addition to those mentioned above. 

For the Cesaro averages one could extend this study to any good averaging 
process. For instance consider an increasing sequence of natural numbers p(n) 
which is good for the pointwise convergence in L p (such as the sequence of 
squares in L p (for 1 < p < oo [BouQO]). 

Question 7.1. Do the averages J2n=i f(T p( - nS> x)g(S n y) converge a.e. vl 
Question 7.2. Do the averages i 5Z n=1 f(T p ^x)g(S p< - n ^y) converge a.e. vl 

Question 7.3. For the previous two questions, what is the range of functions for 
which the result is true? 

Question 7.4. Can one have a multiple term return times result for the averages 
above? 

Question 7.5. What would be the characteristic factors for the averages in ref- 
erenced in the above questions? 

One could also look at the same questions translated to the corresponding 
Hilbert Transform. Note that the notion of characteristic factors is not the same 
for the Hilbert transform and the Cesaro averages. One has to think instead 
in terms of the a.e. continuity of the limit of the series. For instance for the 
averages in Question |7.2| the corresponding averages to study with respect to 
Question |7.5| would be the a.e continuity in t of the series 




'p(n)x p 27rip(n)t 



once the function / is orthogonal to the appropriate factor. 

Some of these questions have been raised by the first author during prob- 
lem sessions at the Ergodic Theory Workshops he has organized yearly at The 
University of North Carolina at Chapel Hill since the summer of 2002. 
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